Picture for Shuyue Stella Li

Shuyue Stella Li

BLAB: Brutally Long Audio Bench

Add code
May 05, 2025
Viaarxiv icon

A False Sense of Privacy: Evaluating Textual Data Sanitization Beyond Surface-level Privacy Leakage

Add code
Apr 28, 2025
Viaarxiv icon

Aligning LLMs to Ask Good Questions A Case Study in Clinical Reasoning

Add code
Feb 20, 2025
Viaarxiv icon

CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs

Add code
Oct 03, 2024
Figure 1 for CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs
Figure 2 for CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs
Figure 3 for CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs
Figure 4 for CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs
Viaarxiv icon

ValueScope: Unveiling Implicit Norms and Values via Return Potential Model of Social Interactions

Add code
Jul 02, 2024
Figure 1 for ValueScope: Unveiling Implicit Norms and Values via Return Potential Model of Social Interactions
Figure 2 for ValueScope: Unveiling Implicit Norms and Values via Return Potential Model of Social Interactions
Figure 3 for ValueScope: Unveiling Implicit Norms and Values via Return Potential Model of Social Interactions
Figure 4 for ValueScope: Unveiling Implicit Norms and Values via Return Potential Model of Social Interactions
Viaarxiv icon

Teaching LLMs to Abstain across Languages via Multilingual Feedback

Add code
Jun 22, 2024
Figure 1 for Teaching LLMs to Abstain across Languages via Multilingual Feedback
Figure 2 for Teaching LLMs to Abstain across Languages via Multilingual Feedback
Figure 3 for Teaching LLMs to Abstain across Languages via Multilingual Feedback
Figure 4 for Teaching LLMs to Abstain across Languages via Multilingual Feedback
Viaarxiv icon

MEDIQ: Question-Asking LLMs for Adaptive and Reliable Clinical Reasoning

Add code
Jun 04, 2024
Figure 1 for MEDIQ: Question-Asking LLMs for Adaptive and Reliable Clinical Reasoning
Figure 2 for MEDIQ: Question-Asking LLMs for Adaptive and Reliable Clinical Reasoning
Figure 3 for MEDIQ: Question-Asking LLMs for Adaptive and Reliable Clinical Reasoning
Figure 4 for MEDIQ: Question-Asking LLMs for Adaptive and Reliable Clinical Reasoning
Viaarxiv icon

CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs' (Lack of) Multicultural Knowledge

Add code
Apr 10, 2024
Figure 1 for CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs' (Lack of) Multicultural Knowledge
Figure 2 for CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs' (Lack of) Multicultural Knowledge
Figure 3 for CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs' (Lack of) Multicultural Knowledge
Figure 4 for CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs' (Lack of) Multicultural Knowledge
Viaarxiv icon

A Quantitative Approach to Understand Self-Supervised Models as Cross-lingual Feature Extractors

Add code
Nov 27, 2023
Viaarxiv icon

Narrowing the Gap between Zero- and Few-shot Machine Translation by Matching Styles

Add code
Nov 04, 2023
Viaarxiv icon